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LINEAR ALGEBRA AND BOOTSTRAP PERCOLATION 

JOZSEF BALOGH, BELA BOLLOBAS, ROBERT MORRIS, AND OLIVER RIORDAN 



Abstract. In H-bootstrap percolation, a set A C V(H) of initially 'infected' ver- 
tices spreads by infecting vertices which are the only uninfected vertex in an edge of 
the hypergraph TL. A particular case of this is the if-bootstrap process, in which TL 
<~ | encodes copies of H in a graph G. We find the minimum size of a set A that leads 

r-|- ■ to complete infection when G and H are powers of complete graphs and H encodes 

induced copies of H in G. The proof uses linear algebra, a technique that is new in 
bootstrap percolation, although standard in the study of weakly saturated graphs, 
which are equivalent to (edge) TJ-bootstrap percolation on a complete graph. 



1. Introduction 



Given a hypergraph "H, the 1-i-bootstrap process is defined as follows. Let A C V{H) 
be a set of initially 'infected' vertices, and, at each time step, infect a vertex u if it 
£\j ■ lies in an edge of % in which all vertices other than u are already infected. To be 

precise, set A = A, and, for each t ^ 0, set 



A t+1 := A t U {u : 3S eU with S\A t = {u}}. 



Let [A\% = lj t>0 v4 t , and say that A percolates (or 'H-percolates) if [A]n = V(H). 

A large family of models of this type was introduced in [9]. Given graphs G and H, 
we obtain the H -bootstrap process on G by setting "H = {V(H') : H' C G and H' = 
H}. (Sometimes it is more natural to consider only induced copies of H in G.) The 
%- and if -bootstrap processes can be seen as special cases of the 'cellular automata' 
S^ ■ introduced by von Neumann (see [16]) after a suggestion of Ulam [17], and generalize 

several previously studied models. For example, if G is a (finite) square grid and 
H = C4 then we obtain the so-called 'Frobose process' (see [T2] or [13] )• 

A fundamental question about bootstrap-type models is the following: given a 
hypergraph % (or a pair (G,H)), how large is the smallest percolating set in the 
^/-bootstrap process? We define 

m{H) := min{|A| : AcV(H), [A] H = V(U)}. 
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Let K d denote the graph with vertex set [nf = {1, . . . ,n} d in which uv is an edge 
if u and v differ in exactly one coordinate. Given 1 ^ r ^ d and 2 ^ t ^ n, let 
K.(n,d,t,r) be the hypergraph with vertex set [n] d in which the edges are all sets 5* 
of the form S = I\ x I 2 x • • ■ Id where r of the sets Ij C [n] have size t and the others 
are singletons. Equivalently, 

E(JC(n,d,t,r)) := {S c [n] d : tf2[S] = K[}, 

the collection of induced copies of -fQ" in K d . Note that -R^ = Q r , the r-dimensional 
hyper cube. 

Our main aim is to determine m(jC(n, d, t, r)) precisely for every n ^ t ^ 2 and 
d ^ r ^ 1. We shall also consider the grid P d with vertex set [n] d , in which two 
vertices are adjacent if they differ by 1 in one coordinate, and agree in all others. 
(This graph is usually denoted [n] d , but here this notation would cause confusion.) 
The corresponding hypergraph V(n, d, t, r) has as edges all sets S = Ii x I 2 x ■ ■ ■ I d 
where r of the Ij are intervals of size t, and the rest are singletons. Note that while 
such sets S induce copies of P[ in P d , they are not the only induced copies. (There 
can also be 'bent' copies.) Clearly V(n,d,t,r) C K,(n,d,t,r). As noted in Section [21 
below, the set of points 

U = {( Ul ,...,u d )e [n} d : \{i: Ui >t}\ ^ r - l} (1) 

percolates in V(n,d,t,r), and hence also in K.(n,d,t,r). Our main theorem implies 
that the set U is extremal in both hypergraphs. 

Theorem 1. For every n ^ t ^ 2 and d ^ r ^ 1, 

r-l / ,\ 

m(/C(n, d, t, r)) = m(P(n, d, t, r)) = ^ ( ) (t - l) d " s (r2 + 1 - t) s . 

The special case r = d oi this result was proved over 25 years ago by Alon [TJ using 
techniques from exterior algebra. In this case U is simply the set of points in which 
at least one coordinate is in the range 1 up to t — 1. 

Corollary 2 ([H Theorem 3.4]). For every n ^ t ^ 2 and d ^ 1 we /iai>e 

m(/C(n,d,t,d)) = m(V(n,d,t,d)) = n d - {n + l - t) d . 

We remark that Alon's Theorem was phrased in terms of edge percolation in com- 
plete multi-partite hypergraphsj, and was somewhat more general than Corollary [2j 
Indeed, given a vector t = (t±, . . . , to) E N d , let JC*(n, d, t) denote the hypergraph on 
[n] d whose edges are copies of K tl x . . . x K td which have 'length' tj in direction j for 
each 1 ^ j ^ d. That is, those copies which sit 'as we expect' and with a prescribed 
orientation. In [T], Alon determined m(/C*(n, d, t)) for every n, d G N and t G N rf ; 
thus Theorem [T] can be seen as the natural analogue of Alon's Theorem when we allow 
the copies of K r t to be oriented in any direction. In Section [3] we present a result that 
generalizes both Alon's result and Theorem HJ 



Translating from edges in a hypergraph to vertices in the line graph maps his result to Corollary[5] 
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Before turning to the proof of Theorem [Tj, let us give a little context. The first 
extremal result related to bootstrap percolation was proved by Bollobas [8], and 
phrased in the language of 'weakly saturated graphs'. This is the natural edge version 
of the ./^-bootstrap percolation we have just defined (infect an edge if it is the last 
uninfected edge of a copy of H), with G complete. The main aim of [8] was to pose 
a conjecture concerning the extremal number when H = K\. and G = K n . This 
conjecture was proved by Alon [1], Frankl [11] and Kalai [15], using linear algebraic 
methods. 

The ^-bootstrap process is named after a closely related model, known as r- 
neighbour bootstrap percolation, which was introduced in 1979 by Chalupa, Leath 
and Reich [TU] as a model of disordered magnetic systems. In this process, a vertex of 
a graph G becomes infected when it has at least r infected neighbours, and infected 
vertices remain infected forever. We remark that this is similar to if-bootstrap per- 
colation with H a star, except that a given copy of H can only be responsible for 
infecting its central vertex; it is thus a special case of the natural 'directed' version of 
"H-bootstrap percolation, in which each edge can infect only a single specified vertex. 
The r-neighbour bootstrap process has been extensively studied by mathematicians 
and statistical physicists (see p±JEJ[II], for example, and the references therein). For 
further background see Bollobas [9]. 

In r-neighbour bootstrap percolation, one is mainly interested in estimating the 
critical threshold in the random setting: if the initially infected set A is formed by 
selecting vertices independently with probability p, for which p is it likely that even- 
tually all vertices are infected? In the study of this probabilistic question, extremal 
results turn out to be important (see [I] or [13], for example). One of our main mo- 
tivations in this work is to approach the following tantalizing open problem, which is 
our main stumbling block in attacking the probabilistic question on the hypercube. 
Let m(G,r) denote the minimum size of a percolating set in r-neighbour bootstrap 
percolation on G. In [3J, Balogh and Bollobas made the following conjecture. 

Conjecture 1. Let r ^ 3 be fixed. Then 

m(Q d ,r) = (^ + «(l))( r ! 1 

as d —)• oo. 

The upper bound in Conjecture [1] follows by taking a Steiner system at level r, 
together with all of level r — 2. Amazingly, we know of no super-linear lower bound. 
In the case r = 2 the situation is simpler, and m(P^, 2) is known exactly for all n 
and d (see [3J or [1]). At the other end of the range, Pete (see [7]) observed that 
m(P^d) = n . However, for fixed 2 < r < d, m(P^,r) is known only up to a 
constant factor that depends on d. 

Finally, we remark that the random questions are also interesting in the if-bootstrap 
model, and that some of the basic problems (in the 'edge version') are solved in [5] 
by the first three authors. As the reader might guess, however, there are still many 
more open problems than theorems. 
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The rest of this note is arranged as follows. In Section [2] we prove Theorem dj and 
in Section [3] we discuss an inhomogeneous extension. 

2. Proof of Theorem rjj 
The proof of Theorem [1] is based on the following observation. 

Lemma 3. LefH be an arbitrary hypergraph. Suppose that we can find a vector space 
W spanned by vectors {f v : v G V(fi)} such that, for every edge S G Ti, we have a 
linear dependence Ylves ^S,vfv — with all coefficients Xs lV non-zero. Then 

m(H) ^ dimly. 

Proof. Once one thinks of the statement, the proof is essentially immediate. Indeed, 
suppose that A C V(H) percolates in the "H-process. Then we can order the vertices 
Vi,...,V£ in V(7i) \ A so that each Vi is in an edge Si of H in A{ = A U {vj : j ^ i). 
Let Wi be the span of the vectors {f v : v G AA-. For each i ^ 1, the dependency 
condition for Si and that fact all other vertices of Si are in Aj_i together imply that 
f Vi is a linear combination of vectors in Wj_i. Indeed, there exist Xsi,v, one for each 
v G Si and all non-zero, such that 






Thus Wi = Wi-t and hence Wi = W . By assumption, A e = V(H), so We = W. 
Since Wq is spanned by \A\ vectors, we have \A\ ^ dimly. D 

To prove Theorem (U we must find the right vectors. Since the notation in the 
formal proof may perhaps obscure the ideas, we first outline some special cases. 
Given v G [n] d we write \v\ for the sum of the coordinates of v. 

Call a coordinate i of a vector v large if Vi ^ t and small otherwise, and let 

U = {(u!,...,u d ) G [n} d : \{i:ui^t}\ < r - 1} 

be the set of all v G [n] d with at most r — 1 large coordinates, as in ([1]). We begin 
by assigning to each vertex u G U an abstract vector e u , and assume that the vectors 
{e u : n G [/} are linearly independent. 

Suppose first that £ = 2 and r = d, so the edges S of our hypergraph % are 
hypercubes Qd of full dimension in [n] d , and [/ = {x£ [n] d : min^ Xj = 1} is a union 
of (d — l)-dimensional faces of [n] d . Now, for each v G [n] d and k G [of], let 7Tfc(v) 
denote the projection of v onto the face {x G [n] d : Xk = 1}, and set 

d 

JV — / ^ e 7Tfc(l))- 

fc=l 

We claim that the dependency condition in Lemma |3] holds with Xs >v = ±1 for each 
S and u, simply by letting the sign alternate over the vertices of the cube S in the 
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obvious way. To see this, consider the contribution to 

d d 

/ , ^s,vfv = 2_^ \s,v 2_^ e^iv) = 2_^ 2_^ ^s,ve Wk ( v ) 
v&s vgs k=i fc=i ties' 

from terms with a given value of k. The vertices of S can be grouped into pairs {v, v'} 
differing only in the k th coordinate, and so with 7Tk(v) = ^kiy'). The choice of sign 
ensures that the corresponding contributions to the sum cancel. One must also check 
that the vectors {f v : v G [n] d } have the same span as the vectors {e u : u G U}; this 
follows from the fact that, for each v G U, f v is equal to c v e v plus a sum of terms 
involving e u with |w| < \v\, where c v > is the number of small coordinates of v. 

The case t — 2 and general 1 ^ r ^ d is not much harder: now U consists of the 
(r — l)-dimensional faces {x G [n] d : Xj x — . . . = Xj d _ r+1 = 1}, and we define ttt(v) to 
be the projection onto the face indexed by T = {j 1; . . . , j d _ r+1 }, and set 



/. = E 



■kt(v) i 



T 



where the sum runs over all sets T C [d] of size d — r + 1. Setting A^,, = ±1 for 
each 5 and t> , exactly as before, we can again group the vertices of S into pairs that 
project to the same point; the proof is now exactly as above. 

Now suppose that r = d, so we are back to projecting out single coordinates, but 
that t ^ 3. In order to define the vectors f v , consider first a single coordinate k, and 
a line v (1), . . . ,v(n) of points of [n] d differing only in the k th coordinate, with v{i) 
having k th coordinate i. We shall set 



/« = £/, 



(k) 

v ^ 



k=l 



where each f^,l is a linear combination of the vectors e„(i), . . . , e„( t _i). For those v(i) 

with A; i?i coordinate small (i.e., i ^ t — 1), we just take /^a = e„(j); for those with /c i/l 
coordinate large, we want to choose the linear combinations such that the following 
holds: 

For any i\ < . . . < i t , the vectors f^J.\ are 'minimally' dependent, (2) 

meaning that they satisfy a linear dependence with all coefficients non-zero. We 
remark that the vectors v(i\), . . . ,v(it) will be t points in a line in an edge of "H, 
i.e., a copy of K r t . It is clearly possible to choose the linear combinations so that (J5J) 
holds, simply by choosing the linear combinations to be 'in general position'. Now, 
given an edge S of "H, we take the coefficients of these dependencies (which we take 
to be the same for all lines in direction k) as one factor contributing to Xs, v ) there is 
a similar factor for each coordinate k. (Note that this is in fact exactly what we did 
in the case t = 2: there, for a given k, each f^,l is simply e^i), and the coefficients 
of our dependency are ±1.) The proof now follows as before. 
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In the fully general case we sum over all projections onto 'thickened' (r — 1)- 
dimensional faces. When projecting out d — r + 1 ^ 2 coordinates, we choose co- 
efficients for the (t — i) d - r+1 allowed image vectors by multiplying the coefficients 
associated to projecting out a single coordinate; a formal description follows. 

Proof of TheoremUi Fix n ^ t ^ 2 and d ^ r ^ 1, and set K, = K.(n,d,t,r) and 
V = V(n, d, t, r). As above, for v = (vi, . . . , Vd) E [n] d we say that coordinate i of v 
is large if Vi ^ t and small otherwise. As in the statement of the theorem, let U be 
the set of all v E [n] d with at most r — 1 large coordinates, and note that 



\ u \ = EQ( t - 1 ) d " s (^- t + i : 



=0 

Suppose that the set of initially infected vertices is exactly U . Then every vertex v 
is eventually infected in the P-process. Indeed, for v ^ U we can use any r large 
coordinates of v to construct a copy H of P[ in P d with w as the 'top' vertex, i.e., 
with \u\ < \v\ for all other vertices u of H . It follows by induction on \v\ that all v 
are infected eventually. Hence, since V C K,, 

m(/C) ^ m(V) ^ \U\. (3) 

For the lower bound let W be a (real) vector space with basis {e u : n E U}, so 
dimly = \U\. (Here the e u are simply abstract linearly independent vectors.) By 
Lemma [3] it suffices to define vectors {f v : v E [n] d } with f v E W in such a way that 
(i) {f v } spans W and (ii) for every S* C [n] d which is an edge of JC(n, d, t, r) there are 
non-zero coefficients Xs, v E M such that 

J2 X s,vfv = 0. (4) 

ves 

Let us fix once and for all an n x (t — 1) matrix M = (jTiyOi^jsgn,!^-^*-! with the 
following properties: the first t — 1 rows of M form the identity matrix, all entries of 
M are non-negative, and any t — 1 rows of M are linearly independent. Such matrices 
clearly exist: in constructing the next row (after the first t — 1) we just avoid the 
union of a finite number of (t — 2)-dimensional subspaces. Since any t rows of M 
are dependent, but no t — 1 rows are, for any subset / C [n] with \I\ = t there are 
non-zero coefficients (A/^jgj such that 

> Xi^mij = for each j — 1, 2, . . . , t — 1. (5) 

Given a point u = (i>i, . . . , Vd) E [n] d , a coordinate 1 ^ k ^ d and a 'value' j E [n], 
let 7ij(v ) denote the point (i>i, . . . , Vk-i,j, ffc+i, • • • , Vd) obtained by starting from v and 
setting the value of the k th coordinate to j. Similarly, for distinct k\, /C2, . . . , k p E [d] 
and arbitrary ji, ■ ■ ■ ,j p E [n] let 

-tt(-) = -SO" ->)•••) 
be the point obtained from v by setting the value of coordinate ki to ji for 1 ^ i ^ p. 
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Let P = {ki, . . . , k p } be any subset of [d] with cardinality p = d — r + 1; these will 
be the coordinates 'projected out'. For v G [n] d , define 

t-i t-\ p 

fv p) = yi ■ ■ ■ z) n^-v 1 --'^)' ^ 

. ji.---.jp ^ ^ 

Ji=l 3 p =la=l 

where z a = i>j. a is the original value of the a th coordinate which is projected out; we 
shall see in a moment that this defines a vector in W. We set 

/, = £/f\ (7) 

p 

where the sum is over all P C [d] with \P\ = p. 

We shall break up the remaining calculation into three simple claims. 

Claim 1. f v eW for each v G [n] d . 

Proof. Let v G [n] d and P C [d] with \P\ — d — r + 1, and suppose that e u appears 
in the sum in (JBJ), with it = 7T,- 1 ' '"'- p (f). Then the coordinates which are projected 
out (the fc a ) take values (the j a ) that are small. Since only d — p = r — 1 coordinates 
are not projected out, u has at most r — 1 large coordinates, so u G £/", and e u G W. 
Thus /i P) G W and hence /„ G W. D 

Claim 2. T/ie vectors {f v : v G [n] d } span W . 

Proof. Recall that \v\ denotes the sum of the coordinates of v. We shall show that 
for every v G U, there exist constants c v > and [i uv G M. such that 

Jv C v e v T" ? f^uv&u- \o) 

u&U : |u|<|u| 

To prove dHJ), note first that m^- 7^ implies that j ^ i, since the first t — 1 rows of 
M form the identity matrix, and so either % = j ^ t — 1, or % ^ t and j ^ t — 1. 
By ((6]), it follows that /« (and hence f v ) is a linear combination of vectors e u with 
I it I ^ \v |, for every u G [n] d and every P C [c(| with \P\ — d — r + 1. Moreover, since 
u ^ v holds coordinate- wise, we have |it| < |v| unless u = v. 

Now, suppose that i» G t/. Then f has at least p small coordinates, so we can choose 

some P consisting only of small coordinates of v. Taking j a = i a in each sum in (JBJ, 

(p) 
we see that e v appears at least once in the formula defining /„ ' . Since we chose all of 

the rriij to be non-negative, it follows that all our coefficients are non-negative. Thus 

(p) 
fv , and hence /„, has a strictly positive coefficient of e v , as required. 

From ([8]) it follows by elementary linear algebra that the vectors {f v : v G U} span 

W. Indeed, writing span(-) for the linear span of a set of vectors, one can show by 

induction on s that for each s, 

span({/y : v G U, \v\ ^ s}) = span({e„ : v G U : \v\ ^ s}). 

Hence the vectors {f v : w G [n] d } span W, as claimed. □ 
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It remains only to establish the dependency condition (J3J); the first step is to define 
the coefficients Xs, v - For each edge S of JC(n,d,t,r), i.e., for each induced copy of 
K\ in [n] d , let D(S) denote the set of coordinates in which S varies, so \D\ = r. 
Moreover, for each £ e D(S) let Ig = Ie(S) denote the set of values taken by the 
^-coordinates of points in S, so \I(\ = t. Now, for each v = (vi, . . . , v^) G S, set 

^S,v = ^h,v n (9) 

£eD(S) 

where the coefficients A/^ are as in (JS}. Note that \s, v 7^ for all v G S. 
Claim 3. For each S G K, and each P C [d] with \P\ = d — r + 1, we have 

E A Wi p) = o. (10) 

Proof. We shall establish ( TTOl) by partitioning the vertices of S into lines, and showing 
that the sum over the t vertices v in any such line L is zero. To do so, recall that 
\P\ = d-r + l and \D(S)\ = r, so \PnD(S)\ ^ 1. Let kePnD(S), and partition 
the t r vertices of S into t r_1 lines of size t, i.e., sets of vertices in S differing only in 
the k th coordinate. 

Let L be one of these lines, and observe that if w G L then L = {wf (w) : i G /&}. 
By (jSJ), for every 1 ^ j ^ t - 1 we have Y^iiei X h,i m ij = 0' an d hence 

t-x 

j=X ieh n--,i v -i,j 

for every 1 ^ j l5 . . . , j p _ 1 ^ t — 1 and every hi, . . . , k v _ x 7^ k. It follows by ((BJ) that 

t-i t-i /p-i \ t-i 

where P = {A4, . . . , /c p } with k p = k, and (for each a < p) i a is the k^ coordinate of 
w, and hence of 7if (u>), since k a ^ k. 

Finally observe that, in the product (jUJ), as u ranges over the line L only the factor 
corresponding to coordinate k varies. Thus 

J2>^s,vfi P) = ^2 X s,nf(w)fJk (w) = f n \i t ,v, A 5^ Xl k ,ifjk {w) = 0. (12) 



Summing (1121) over all r lines L proves the claim. D 

Finally, observe that (J3J) follows immediately from Claim 3 by summing over P. 
Combining (J3J) with Claims 1 and 2, and applying Lemma [31 it follows that 

r-X 



m(K) > |t/| = E^)( t " r 



d ~>-£ + l) s , 



as required. This completes the proof of Theorem [TJ □ 
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3. An inhomogeneous extension 

In Theorem [I] we took the dimensions (that is, the side- lengths) of our 'host graph' 
K% or P% to be all equal to n purely for notational convenience. The proof is unaltered 
if we replace [n] d by [rii] x • • • x [rid]', we simply take a matrix M with at least max^ rik 
rows, and define fij exactly as before. 

In fact, there is no need to use the same matrix for different coordinate axes: the 
proof still works if we use different matrices M^ for different coordinates 1 ^ k ^ 
d. This allows us to consider graphs H that are grids with different dimensions in 
different coordinates. However, when r < d, we must be careful - we must fix numbers 
t±, . . . ,td and consider r-dimensional grids H with the property that if coordinate k 
varies in H then it takes tk distinct values. If this seems unnatural, then the reader 
should consider only the case r = d, which corresponds to the result of Alon [T] 
mentioned earlier. 

Define K = JC(ni, . . . , rid, h, ■ ■ ■ ,td,r) to be the hypergraph with vertex set [rii] x 
• • • x [n>d] m which the hyperedges are all sets S of the form S = I\ x • • • x Id with 
Ik C [rik] for each k, and either |Ij.| = 1 or \Ik\ = tk, with the second case occurring 
for exactly r coordinates k. Define V C K. similarly, but with the extra restriction 
that each Ik is an interval. Finally, let U be the set of vertices v = (t>i, . . . ,Vd) in 
which at most r — 1 coordinates Vk satisfy Vk ^ tk- The following theorem says that 
the set U is extremal in both V and /C. 

Theorem 4. Let d ^ r ^ 1 and let rik ^ tk ^ 2 for each 1 ^ k ^ d. Then 

mm = m (v) = Yi (n( n * +i -**))(n(**- 1 ))- 

Proof. The theorem follows by a simple adaptation of the proof of Theorem [1] above, 
taking the k th coordinate of v — (vi,...,Vd) to be large if Vk ^ tk- In place of 
the matrix M, we choose an rik x {tk ~ 1) matrix M^ for each coordinate k, and 
replace (jSJ) by 

The remainder of the proof is identical. □ 
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